Picture for Wen Huang

Wen Huang

RL-VLA$^3$: Reinforcement Learning VLA Accelerating via Full Asynchronism

Add code
Feb 05, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

A Data-Centric Approach to Generalizable Speech Deepfake Detection

Add code
Dec 24, 2025
Figure 1 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 2 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 3 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 4 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Viaarxiv icon

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation

Add code
Nov 10, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon

Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation

Add code
Jan 24, 2025
Viaarxiv icon

Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification

Add code
Oct 22, 2024
Figure 1 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 2 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 3 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 4 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Viaarxiv icon

Unified Audio Event Detection

Add code
Sep 13, 2024
Figure 1 for Unified Audio Event Detection
Figure 2 for Unified Audio Event Detection
Figure 3 for Unified Audio Event Detection
Figure 4 for Unified Audio Event Detection
Viaarxiv icon

Riemannian Federated Learning via Averaging Gradient Stream

Add code
Sep 11, 2024
Viaarxiv icon

Visual Hallucinations of Multi-modal Large Language Models

Add code
Feb 22, 2024
Figure 1 for Visual Hallucinations of Multi-modal Large Language Models
Figure 2 for Visual Hallucinations of Multi-modal Large Language Models
Figure 3 for Visual Hallucinations of Multi-modal Large Language Models
Figure 4 for Visual Hallucinations of Multi-modal Large Language Models
Viaarxiv icon